Recursive Stochastic Games with Positive Rewards

نویسندگان

  • Kousha Etessami
  • Dominik Wojtczak
  • Mihalis Yannakakis
چکیده

We study the complexity of a class of Markov decision processes and, more generally, stochastic games, called 1-exit Recursive Markov Decision Processes (1-RMDPs) and Simple Stochastic Games (1-RSSGs) with strictly positive rewards. These are a class of finitely presented countable-state zero-sum stochastic games, with total expected reward objective. They subsume standard finite-state MDPs and Condon’s simple stochastic games and correspond to optimization and game versions of several classic stochastic models, with rewards. Such stochastic models arise naturally as models of probabilistic procedural programs with recursion, and the problems we address are motivated by the goal of analyzing the optimal/pessimal expected running time in such a setting. We give polynomial time algorithms for 1-exit Recursive Markov decision processes (1-RMDPs) with positive rewards. Specifically, we show that the exact optimal value of both maximizing and minimizing 1-RMDPs with positive rewards can be computed in polynomial time (this value may be ∞). For two-player 1-RSSGs with positive rewards, we prove a “stackless and memoryless” determinacy result, and show that deciding whether the game value is at least a given value r is in NP ∩ coNP. We also prove that a simultaneous strategy improvement algorithm converges to the value and optimal strategies for these stochastic games. We observe that 1-RSSG positive reward games are “harder” than finite-state SSGs

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discounted stochastic games poorly approximate undiscounted ones

The purpose of this note is to summarize recent results [6, 4, 5] on stochastic games by the author and his collaborators. These results have appeared or will appear in proceedings of computer science conferences. The intended reader of this note has an interest in finite stochastic games, but no particular interest in computation, that is, the computational aspects of the results obtained have...

متن کامل

Uniform Equilibrium: More Than Two Players

Until this day, no counter example was found. Furthermore, we have seen that a positive answer was given for several special classes, including recursive games (Everett, 1957), zero-sum games (Mertens and Neyman, 1981), two-player absorbing games (Vrieze and Thuijsman, 1989) and two-player non zero-sum games (Vieille, 1997b). For n-player games, existence of stationary equilibrium profiles was ...

متن کامل

On the computational complexity of solving stochastic mean-payoff games

We consider some well known families of two-player, zero-sum, turn-based, perfect information games that can be viewed as specical cases of Shapley’s stochastic games. We show that the following tasks are polynomial time equivalent: • Solving simple stochastic games, • solving stochastic mean-payoff games with rewards and probabilities given in unary, and • solving stochastic mean-payoff games ...

متن کامل

Stochastic Games and Related Classes

For n-person perfect information stochastic games and for n-person stochastic games with Additive Rewards and Additive Transitions (ARAT) we show the existence of pure limiting average equilibria. Using a similar approach we also derive the existence of limiting average "-equilibria for two-person switching control stochastic games. The orderreld property holds for each of the classes mentioned...

متن کامل

The Complexity of Solving Stochastic Games on Graphs

We consider some well-known families of two-player zero-sum perfect-information stochastic games played on finite directed graphs. Generalizing and unifying results of Liggett and Lippman, Zwick and Paterson, and Chatterjee and Henzinger, we show that the following tasks are polynomial-time (Turing) equivalent. – Solving stochastic parity games, – Solving simple stochastic games, – Solving stoc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008